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Abstract 



We propose an adaptive, two steps strategy, for the estimation of mixed qubit states. We show that the 
strategy is optimal in a local minimax sense for the trace norm distance as well as other locally quadratic 
figures of merit. Local minimax optimality means that given n identical qubits, there exists no estimator 
which can perform better than the proposed estimator on a neighborhood of size n -1 / 2 of an arbitrary 
state. In particular, it is asymptotically Bayesian optimal for a large class of prior distributions. 

We present a physical implementation of the optimal estimation strategy based on continuous time 
measurements in a field that couples with the qubits. 

The crucial ingredient of the result is the concept of local asymptotic normality (or LAN) for qubits. 
This means that, for large n, the statistical model described by n identically prepared qubits is locally 
equivalent to a model with only a classical Gaussian distribution and a Gaussian state of a quantum 
harmonic oscillator. 

The term 'local' refers to a shrinking neighborhood around a fixed state po- An essential result is that 
the neighborhood radius can be chosen arbitrarily close to n -1 / 4 . This allows us to use a two steps 
procedure by which we first localize the state within a smaller neighborhood of radius n~ 1 / 2+c , and then 
use LAN to perform optimal estimation. 



1 Introduction 

State estimation is a central topic in quantum statistical inference [521 15T1 [25] . In broad terms the 
problem can be formulated as follows: given a quantum system prepared in an unknown state p, one 
would like to reconstruct the state by performing a measurement M whose random result X will be used 
to build an estimator p(X) of p. The quality of the measurement-estimator pair is given by the risk 

R p (M,p)=E(d(p(X),p) 2 ), (1.1) 
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where d is a distance on the space of states, for instance the fidelity distance or the trace norm, and the 
expectation is taken with respect to the probability distribution P^ of X, when the measured system 
is in state p. Since the risk depends on the unknown state p, one considers a global figure of merit by 
either averaging with respect to a prior distribution n (Bayesian setup) 

R a (M,p)=[ir(dp)R p (M,p), (1.2) 

or by considering a maximum risk (pointwise or minimax setup) 

i? max (M, p) = sup R P (M, p). (1.3) 
p 

An optimal procedure in either setup is one which achieves the minimum risk. 

Typically, one measurement result does not provide enough information in order to significantly narrow 
down on the true state p. Moreover, if the measurement is "informative" then the state of the system 
after the measurement will contain little or no information about the initial state [35] and one needs 
to repeat the preparation and measurement procedure in order to estimate the state with the desired 
accuracy. 

It is then natural to consider a framework in which we are given a number n of identically prepared 
systems and look for estimators p n which are optimal, or become optimal in the limit of large n. This 
problem is the quantum analogue of the classical statistical problem [3D] of estimating a parameter 9 
from independent identically distributed random variables X\ , . . . , X n with distribution Pg , and some 
of the methods developed in this paper are inspired by the classical theory. 

Various state estimation problems have been investigated in the literature and the techniques may be 
quite different depending on a number of factors: the dimension of the density matrix, the number 
of unknown parameters, the purity of the states, and the complexity of measurements over which one 
optimizes. A short discussion on these issues can be found in section [2] 

In this paper we give an asymptotically optimal measurement strategy for qubit states that is based 
on the technique of local asymptotic normality introduced in [221 123j . The technique is a quantum 
generalisation of Le Cam's classical statistical result [3DJ , and builds on previous work of Hayashi and 
Matsumoto [25], [29] . We use an adaptive two stage procedure involving continuous time measurements, 
which could in principle be implemented in practice. The idea of adaptive estimation methods, which has 
a long history in classical statistics, was introduced in the quantum set-up by [7], and was subsequently 
used in [5TJ (3D] [3D] . The aim there is similar: one wants to first localize the state and then to perform 
a suitably tailored measurement which performs optimally around a given state. A different adaptive 
technique was proposed independently by Nagaoka [46j and further developed in [16] . 

In the first stage, the spin components o~ Xl a y and o~ z are measured separately on a small portion n <C n 
of the systems, and a rough estimator p n is constructed. By standard statistical arguments (see Lemma 
12. 1[) we deduce that with high probability, the true state p lies within a ball of radius slightly larger 
than n -1 / 2 , say n" 1 / 2+c with e > 0, centered at p n . The purpose of the first stage is thus to localize the 
state within a small neighborhood as illustrated in Figure [1] (up to a unitary rotation) using the Bloch 
sphere representation of qubit states. 

This information is then used in the second stage, which is a joint measurement on the remaining n — h 
systems. This second measurement is implemented physically by two consecutive couplings, each to a 
bosonic field. The qubits are first coupled to the field via a spontaneous emission interaction and a 
continuous time heterodyne detection measurement is performed in the field. This yields information 
on the eigenvectors of p. Then the interaction is changed, and a continuous time homodyne detection is 
performed in the field. This yields information on the eigenvalues of p. 
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Figure 1: After the first measurement stage the state p lies in a small ball centered at p. 



We prove that the second stage of the measurement is asymptotically optimal for all states in a ball of 
radius n _1 / 2+ '' around p n . Here r\ can be chosen to be bigger that e > implying that the two stage 
procedure as a whole is asymptotically optimal for any state as depicted in Figure [2] 




Figure 2: The smaller domain is the localization region of the first step. The second stage 
estimator is optimal for all states in the bigger domain. 



The optimality of the second stage relics heavily on the principle of local asymptotic normality or LAN, 
see [49], which we will briefly explain below, and in particular on the fact that it holds in a ball of radius 
rt _1/ ' 2+I ' around p~ n rather than just n -1 / 2 as it was the case in [22] . 

Let po be a fixed state. We parametrize the neighboring states as p u /^, where u = (u x ,u y ,u z ) G M 3 
is a certain set of local parameters around po. Then LAN entails that the joint state p" := P®J^ of 
n identical qubits converges for n — + oo to a Gaussian state of the form N u ® <fr u , in a sense explained 
in Theorem 13.11 By iV u we denote a classical one-dimensional normal distribution centered at u z . The 
second term </> u is a Gaussian state of a harmonic oscillator, i.e. a displaced thermal equilibrium state 
with displacement proportional to (u x ,u y ). We thus have the convergence 

p»^N u ®0 u , 

to a much simpler family of classical - quantum states for which we know how to optimally estimate the 
parameter u [32J, [55] . 

The idea of approximating a sequence of statistical experiments by a Gaussian one goes back to Wald 
[53j . and was subsequently developed by Le Cam [40) who coined the term local asymptotic normality. 
In quantum statistics the first ideas in the direction of local asymptotic normality for d-dimensional 
states appeared in the Japanese paper [57], as well as [25] and were subsequently developed in [25] • In 
Theorem 13.11 we strengthen these results for the case of qubits, by proving a strong version of LAN in 
the spirit of Le Cam's pioneering work. We then exploit this result to prove optimality of the second 



3 



stage. A different approach to local asymptotic normality has been developed in [23j to which we refer 
for a more general exposition on the theory of quantum statistical models. A short discussion on the 
relation between the two approaches is given in the remark following Theorem 13. 11 

From the physics perspective, our results put on a more rigorous basis the treatment of collective states 
of many identical spins, the keyword here being coherent spin states [33] . Indeed, it has been known 
since Dyson [13] that n spin-i particles prepared in the spin up state 1 f)®" behave asymptotically as 
the ground state of a quantum oscillator, when considering the fluctuations of properly normalized total 
spin components in the directions orthogonal to z. We extend this to spin directions making an "angle" 
of order n -1 / 2+ '' with the z axis, as illustrated in Figure [31 as well as to mixed states. We believe that a 
similar approach can be followed in the case of spin squeezed states and continuous time measurements 
with feedback control [19] . 




Figure 3: Total spin representation of the state of n 1 spins: the quantum fluctuations of the x 
and y spin directions coincide with those of a coherent state of a harmonic oscillator. 



In Theorem 14.11 we prove a dynamical version of LAN. The trajectory in time of the joint state of the 
qubits together with the field converges for large n to the corresponding trajectory of the joint state 
of the oscillator and field. In other words, time evolution preserves local asymptotic normality. This 
insures that for large n the state of the qubits "leaks" into a Gaussian state of the field, providing a 
concrete implementation of the convergence to the limit Gaussian experiment. 

The punch line of the paper is Theorem 16 . 1 1 which says that the estimator p n is optimal in local minimax 
sense, which is the modern statistical formulation of optimality in the frequentist setup [49j . Also, its 
asymptotic risk is calculated explicitly. 

The paper is structured as follows: in section [21 we show that the first stage of the measurement 
sufficiently localizes the state. In section [31 we prove that LAN holds with radius of validity n~ 1 / 2+I ', 
and we bound its rate of convergence, sections [4] and \5\ are concerned with the second stage of the 
measurement, i.e. with the coupling to the bosonic field and the continuous time field-measurements. 
Finally, in section [51 asymptotic optimality of the estimation scheme is proven. 

The technical details of the proofs are relegated to the appendices in order to give the reader a more 
direct access to the ideas and results. 



2 State estimation 

In this section we introduce the reader to a few general aspects of quantum state estimation after which 
we concentrate on the qubit case. 
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State estimation is a generic name for a variety of results which may be classified according to the dimen- 
sion of the parameter space, the kind or family of states to be estimated and the preferred estimation 
method. For an introduction to quantum statistical inference we refer to the books by Helstrom [31] and 
Holevo [32] and the more recent review paper [B]. The collection 2B] is a good reference on quantum 
statistical problems, with many important contributions by the Japanese school. 

For the purpose of this paper, any quantum state representing a particular preparation of a quantum 
system, is described by a density matrix (positive selfadjoint operator of trace one) on the Hilbert space 
Ti associated to the system. The algebra of observables is B(H), and the expectation of an observable 
a e B(H) with respect to the state p is Tr (pa). A measurement M with outcomes in a measure space 
(X, S) is completely determined by a a-additive collection of positive selfadjoint operators M(A) on H, 
where A is an event in S. This collection is called a positive operator valued measure. The distribution 
of the results X when the system is in state p is given by P p (A) = Tr(pM(A)). 

We are given n systems identically prepared in state p and we are allowed to perform a measurement 
M n whose outcome is the estimator p n as discussed in the Introduction. 

The dimension of the density matrix may be finite, such as in the case of qubits or d-levels atoms, or 
infinite as in the case of the state of a monochromatic beam of light. In the finite or parametric case 
one expects that the risk converges to zero as n _1 and the optimal measurement-estimator sequence 
(M n ,p n ) achieves the best constant in front of the n _1 factor. In the non-parametric case the rates of 
convergence are in general slower that n~ x because one has to simultaneously estimate an infinite number 
of matrix elements, each with rate n _1 . An important example of such an estimation technique is that 
of quantum homodyne tomography in quantum optics [52] , This allows the estimation with arbitrary 
precision [12] [42] [41] of the whole density matrix of a monochromatic beam of light by repeatedly 
measuring a sufficiently large number of identically prepared beams [481 1471 156j . In [TJ [5] it is shown 
how to formulate the problem of estimating infinite dimensional states without the need for choosing a 
cut-off in the dimension of the density matrix, and how to construct optimal minimax estimators of the 
Wigner function for a class of "smooth" states. 

If we have some prior knowledge about the preparation procedure, we may encode this by parametrizing 
the possible states as p = pe with 9 € some unknown parameter. The problem is then to estimate 9 
optimally with respect to a distance function on O. 

Indeed, one of the main problems in the finite dimensional case is to find optimal estimation procedures 
for a given family of states. It is known that if the state p is pure or belongs to a one parameter family, 
then separate measurements achieve the optimal rate of the class of joint measurements [45] . However 
for multi-dimensional families of mixed states this is no longer the case and joint measurements perform 
strictly better than separate ones [2T] , 

In the Bayesian setup, one optimizes R n (M n , p n ) for some prior distribution it. We refer to [36"I PHI 1351 
HH [H H [H for the pure state case, and to [TT] (S3 S3 (35J HEZ] S] for the mixed state case. The 
methods used here are based on group theory and can be applied only to invariant prior distributions and 
certain distance functions. In particular, the optimal covariant measurement in the case of completely 
unknown qubit states was found in Fj] [SpJ but it has the drawback that it does not give any clue as to 
how it can be implemented in a real experiment. 

In the pointwise approach [26, 30, 21, 7, 17, 45, 6, 29J one tries to minimize the risk for each unknown 
state p. As the optimal measurement-estimator pair cannot depend on the state itself, one optimizes the 
maximum risk R m a.x(M n , p n ), (see (|1.3[1 ). or a local version of this which will be defined shortly. The 
advantage of the pointwise approach is that it can be applied to arbitrary families of states and a large 
class of loss functions provided that they are locally quadratic in the chosen parameters. The underlying 
philosophy is that as the number n of states is sufficiently large, the problem ceases to be global and 
becomes a local one as the error in estimating the state parameters is of the order n -1 / 2 . 
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The Bayesian and pointwise approaches can be compared [20j . and in fact for large n the prior distribution 
7r of the Bayesian approach becomes increasingly irrelevant and the optimal Bayesian estimator becomes 
asymptotically optimal in the minimax sense and vice versa. 

2.1 Qubit state estimation: the localization principle 

Let us now pass to the quantum statistical model which will be the object of our investigations. Let 
p G M^ifC) be an arbitrary density matrix describing the state of a qubit. Given n identically prepared 
qubits with joint state we would like to optimally estimate p based on the result of a properly chosen 
joint measurement M n . For simplicity of the exposition we assume that the outcome of the measurement 
is an estimator p n 6 M%(£.). In practice however, the result X may belong to a complicated measure 
space (in our case the space of continuous time paths) and the estimator is a function of the "raw" data 
Pn '■= Pn(X). The quality of the estimator at the state p is quantified by the risk 

R p (M n ,p n ) :=E p (d{p,p n ) 2 ), 

where d is a distance between states. The above expectation is taken with respect to the distribution 
P p {dx) := Tr(pM(dx)) of the measurement results, where M(dx) represents the associated positive 
operator valued measure of the measurement M. In our exposition d will be the trace norm 

\\pi - P2II1 := Tr(|pi - p 2 \), 

but similar results can be obtained using the fidelity distance. The aim is to find a sequence of mea- 
surements and estimators (M n ,p n ) which is asymptotically optimal in the local minimax sense: for any 
given p 

limsup sup nR p (M n , p n ) < limsup sup nR p (N n ,p n ), 

n-*oo I p— po || i<ri- 1 / 2 +« n->oo || p - po || 1 <„-i/2+e 

for any other sequence of measurement-estimator pairs (N n ,p n ). The factor n is inserted because 
typically R p (M n , p n ) is of the order 1/n and the optimization is about obtaining the smallest constant 
factor possible. The inequality says that one cannot find an estimator which performs better that p n 
over a ball of size n _1 / 2+e centered at po, even if one has the knowledge that the state p belongs to that 
ball! 

Here, and elsewhere in the paper e will appear in different contexts, as a generic strictly positive number 
and will be chosen to be sufficiently small for each specific use. At places where such notation may be 
confusing we will use additional symbols to denote small constants. 

As set forth in the Introduction, our measurement procedure consists of two steps. The first one is to 
perform separate measurements of o~ x , o~ y and o~ z on a fraction n = n(n) of the systems. In this way we 
obtain a rough estimate p n of the true state p which lies in a local neighborhood around p with high 
probability. The second step uses the information obtained in the first step to perform a measurement 
which is optimal precisely for the states in this local neighborhood. The second step ensures optimality 
and requires more sophisticated techniques inspired by the theory of local asymptotic normality for qubit 
states [H]. We begin by showing that the first step amounts to the fact that, without loss of generality, 
we may assume that the unknown state is in a local neighborhood of a known state. This may serve 
also as an a posteriori justification of the definition of local minimax optimality. 

Lemma 2.1. Let denote the measurement of the o~i spin component of a qubit with i = x,y, z. We 
perform each of the measurements Mi separately on n/3 identically prepared qubits and define 

Pn=~(l + Ta), if |f|<l, 
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where f = (r x , r y ,f z ) is the vector average of the measured components. If \f\ > 1 then we define p n as 
the state which has the smallest trace distance to the right hand side expression. Then for all e € [0, 2], 
we have 

P(l|Pn-p|li > in 2 *' 1 ) < Gexpt-inr^- 1 ), Vp. 

Furthermore, for any < k < e/2, if n = n x ~ K , the contribution to the risk K(\\p n — p\\\) brought by the 
event E = [ \\p n — p\\\ > \^3n~ 1 / 2+€ ] satisfies 

E ( \\p„ - p\\Ixe ) < 24exp(-in 2e - K ) = o(l). 

Proof. For each spin component Ci we obtain i.i.d coin tosses Xi with distribution P(Xi = ±1) = 
(1 ± ri)/2 and average r^. 

Hoeffding's inequality [50] then states that for all c > 0, we have P(|Xi — X\ 2 > c) < 2exp(— ^nc). By 
using this inequality three times with c = n 2e_1 , once for each component, we get 

P ^2 \h - n\ 2 > 3n 2£ -^j < Qe^-lnn 2 '- 1 ) Vp, 

which implies the statement for the norm distance since \\p n — p\\\ = \ fi — ri\ 2 . The bound on 
conditional risk follows from the previous bound and the fact that \\p — p n \\i < 4. 

□ 

In the second step of the measurement procedure we rotate the remaining n — h qubits such that after 
rotation the vector f is parallel to the z-axis. Afterwards, we couple the systems to the field and perform 
certain measurements in the field which will determine the final estimator p n . The details of this second 
step are given in sections HI and but at this moment we can already prove that the effect of errors in 
the the first stage of the measurement is asymptotically negligible compared to the risk of the second 
estimator. Indeed by Lemma I2TT1 we get that if n = n 1_K , then the probability that the first stage gives 
a "wrong" estimator (one which lies outside the local neighborhood of the true state) is of the order 
exp(— in 2e_K ) and so is the risk contribution. As the typical risk of estimation is of the order 1/n, we 
see that the first step is practically "always" placing the estimator in a neighborhood of order n _1 / 2+e 
of the true state p, as shown in Figure^ In the next section we will show that for such neighborhoods, 
the state of the remaining n — n systems behaves asymptotically as a Gaussian state. This will allow us 
to devise an optimal measurement scheme for qubits based on the optimal measurement for Gaussian 
states. 



3 Local asymptotic normality 

The optimality of the second stage of the measurement relies on the concept of local asymptotic normality 
[5^1 - After a short introduction, we will prove that LAN holds for the qubit case, with radius of 
validity n~ l / 2+r > for all ?y € [0, 1/4). We will also show that its rate of convergence is 0(n~ 1 / 4+71+e ) for 
arbitrarily small e. 

3.1 Introduction to LAN and some definitions 

Let po be a fixed state, which by rotational symmetry can be chosen of the form 

"=(o i°m)< (3 ' 1} 
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for a given | < p < 1. We parametrize the neighboring states as Pu/^n wri ere u = (u x , u y , u z ) £ K 3 such 
that the first two components account for unitary rotations around po, while the third one describes the 
change in eigenvalues 

Pv:=f/(v)(^ x _l_ vz )U(,T, (3.2) 

with unitary U(v) := exp(i(v x <r x + v y a y )). The "local parameter" u should be thought of, as having a 
bounded range in R 3 or may even "grow slowly" as ||u|| < n v . 

Then, for large n, the joint state p" := P®"^ of n identical qubits approaches a Gaussian state of the 
form N u <g> (f> u with the parameter u appearing solely in the average of the two Gaussians. By N u we 
denote a classical one-dimensional normal distribution centered at u z which relays information about 
the eigenvalues of pu/^/^j- The second term </> u is a Gaussian state of a harmonic oscillator which is a 
displaced thermal equilibrium state with displacement proportional to (u x ,u y ). It contains information 
on the eigenvectors of p u /ys- We thus have the convergence 

to a much simpler family of classical - quantum states for which we know how to optimally estimate 
the parameter u. The asymptotic splitting into a classical estimation problem for eigenvalues and a 
quantum one for the eigenbasis has been also noticed in [4] and in [29] , the latter coming pretty close to 
our formulation of local asymptotic normality. 

The precise meaning of the convergence is given in Theorem 13.11 below. In short, there exist quantum 
channels T n which map the states P®"^ into iV u ® <f) u with vanishing error in trace norm distance, and 
uniformly over the local parameters u. From the statistical point of view the convergence implies that a 
statistical decision problem concerning the model p" can be mapped into a similar problem for the model 
N u £g) (f> u such that the optimal solution for the latter can be translated into an asymptotically optimal 
solution for the former. In our case the problem of estimating the state p turns into that of estimating 
the local parameter u around the first stage estimator p n playing the role of po- For the family of 
displaced Gaussian states it is well known that the optimal estimation of the displacement is achieved 
by the heterodyne detection [3S] 023 1 while for the classical part it sufficient to take the observation as 
best estimator. Hence the second step will give an optimal estimator u of u and an optimal estimator 
of the initial state p n :— Pu/^i- The precise result is formulated in Theorem 16. II 

3.2 Convergence to the Gaussian model 

We describe the state N u £g> cf> u in more detail. N u is simply the classical Gaussian distribution 

N u := N(u z ,fi(l — p)), (3.3) 

with mean u z and variance p(l — p). 

The state <j) u is a density matrix on H = J-(C), the representation space of the harmonic oscillator. In 
general, for any Hilbert space [), the Fock space over t) is defined as 

oo 

Hb) ■= f) ® s • • • ® s f>, (3-4) 

71 = 

with £g) s denoting the symmetric tensor product. Thus ^-"(C) is the simplest example of a Fock space. 
Let 

0:=(l-p)Xyi*><*l> (3.5) 

k=Q 
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i H/-U11 CllCl^J UL U11C UOWllOLUl cinei ^» — 

For every a € C define the displaced thermal state 



be a thermal equilibrium state with \k) denoting the fc-th energy level of the oscillator and p = < 1. 



cf)(a) := D{a)cf>D{-a), 

where D(a) :— exp(aa* — ao) is the displacement operator, mapping the vacuum vector |0) to the 
coherent vector 

OO 

|a)=cxp(- a 2 /2)]T^=|fc). 



Jk\ 
k=o v /i - 

Here a* and a are the creation and annihilation operators on J-(C), satisfying [a, a*] = 1. The family 
<f) u of states in which we are interested is given by 



<j> u := cj>(y/2n-la u ), u G M 3 , (3.6) 
with cv u := — u y + iu x . Note that </> u does not depend on u z . 

We claim that the "statistical information" contained in the joint state of n qubits 

Pi ■■= P^, (3-7) 
is asymptotically identical to that contained in the couple (iV u , <f> u ). More precisely: 

Theorem 3.1. Let p" be the family of states (|3.2[) on the Hilbert space (C 2 ) 8 ", let N u be the family 
p.3p of Gaussian distributions, and let </) u be the family (|3.6p of displaced thermal equilibrium states of 
a quantum oscillator. Then for each n there exist quantum channels ( trace preserving CP maps ) 

T n : T((C 2 )® n ) -» L 1 (E) ® T(JT(C)), 
S n : L\R) ® T{F{<C)) -> T((C 2 )®") 

with T{Ti) the trace-class operators on Ti., such that, for any < 77 < 1/4 and any e > 0, 

sup ||7V u ® ( /» u -T„(p^)|| 1 = 0(n- 1 /4+> ) + £)] (3 . 8) 

||U||<711 

sup \\p»-S n (N u ®cl>»)\\ 1 = 0(n- 1 /' L+ *> +e ). (3.9) 

||u||<rai 

Moreover, for each 62 > £/iere exists a function f(n) of order 0(n -1 / 4+, ' +£ ) sucft <aat the above 
convergence rates are bounded by f(n), with f independent of p° as long as |^ — /x| > £2- 

Remark. Note that the equations (|3.8|) and (|3 .9(1 imply that the expressions on the left side converge to 
zero as n — > 00. Following the classical terminology of Le Cam [40], we will call this type of result strong 
convergence of quantum statistical models (experiments). Another local asymptotic normality result 
has been derived in [53] based on a different concept of convergence, which is an extension of the weak 
convergence of classical (commutative) statistical experiments. In the classical set-up it is known that 
strong convergence implies weak convergence for arbitrary statistical models, and the two are equivalent 
for statistical models consisting of a finite number of distributions. A similar relation is conjectured to 
hold in the quantum set-up, but for the moment this has been shown only under additional assumptions 

EH. 



These two approaches to local asymptotic normality in quantum statistics are based on completely 
different methods and the results are complementary in the sense that the weak convergence of [23] 
holds for the larger class of finite dimensional states while the strong convergence has more direct 
consequences as it is shown in this paper for the case of qubits. Both results are part of a larger effort to 
develop a general theory of local asymptotic normality in quantum statistics. Several extensions are in 
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order: from qubits to arbitrary finite dimensional systems (strong convergence), from finite dimensional 
to continuous variables systems, from identical system to correlated ones, and asymptotic normality in 
continuous time dynamical set-up. 

Finally, let us note that the development of a general theory of convergence of quantum statistical 
models will set a framework for dealing with other important statistical decision problems such as 
quantum cloning [54] and quantum amplification |10) . which do not necessarily involve measurements. 

Remark. The construction of the channels T n ,S n in the case of fixed eigenvalues (u z — 0) is given 
in Theorem 1.1 of 22J. It is also shown that a similar result holds uniformly over ||u|| < C for any 
fixed finite constant C. In [23], it is shown that such maps also exist in the general case, with unknown 
eigenvalues. A classical component then appears in the limit statistical experiment. In the above re- 
sult we extend the domain of validity of these Theorems from "local" parameters ||u|| < C to "slowly 
growing" local neighborhoods ||u|| < n v with r\ < 1/4. Although this may be seen as merely a technical 
improvement, it is in fact essential in order to insure that the result of the first step of the estimation 
will, with high probability, fall inside a neighborhood ||u|| < rf for which local asymptotic normality 
still holds (see Figure [2]). 



Proof. Following [22] we will first indicate how the channels T n are constructed. The technical details 
of the proof can be found in Appendix 

The space (C 2 )®" carries two unitary representations. The representation 7r n of 5/7(2) is given by 
K n (u) = u® n for any u € 5/7(2), and the representation % n of the symmetric group S(n) is given by the 
permutation of factors 

7Tn("r) : V\ ® ■ ■ ■ ® v n h-> iV-i(i) ® • • • <8> u T -i(„), t £ S(n). 

As [7r„(u), 7r„(r)] = for all u e 5/7(2), r e S(n), we have the decomposition 

n/2 

n, n;,. (3.10) 

J=0,l/2 



The direct sum runs over all positive (half)-integers j up to n/2. For each fixed j, Tij = C 2j+1 is an 
irreducible representation Uj of 5/7(2) with total angular momentum J 2 = j(j + 1), and W n = C nj is 
the irreducible representation of the symmetric group S(n) with nj = ( ra /2_j) ~ (n/2-j-i) ■ ^ ne density 
matrix p" is invariant under permutations and can be decomposed as a mixture of "block" density 
matrices 

n/2 

Pn= PnA3)pln®-- (3.11) 
.7=0,1/2 } 

The probability distribution p n ,u(j) is given by [4]: 

PnAj) ■= (1 - Mu) f - J Mt +J+1 , (3-12) 
with /i u := fi + u z j \fn, p u :— 1 ~ Mu . We can rewrite p n u (j) as 

PnAj) ■= B n ,^(n/2+j) x K{j,n,n,u), (3.13) 

where 

B n Ak) ■■= (^\ k {l-v) n -\ k = 0,...,n 
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is a binomial distribution, and the factor K(j, n, /i, u) is given by 



Now K(j,n, n,u) = 1 + 0(n _1 / 2+e ) on the relevant values of j, i.e. the ones in an interval of order 
n i/2+e arounc i as i on g as ^ u j g bounded away from 1/2, which is automatically so for big n. As 
B n fMm (k) is the distribution of a sum of i.i.d. Bernoulli random variables, we can now use standard local 
asymptotic normality results [39] to conclude that if j is distributed according to p n ,u, then the centered 
and rescaled variable 

j 

9n •■=—;=- Vn(ti - 1/2), 

converges in distribution to a normal N u , after an additional randomization has been performed. The 
latter is necessary in order to "smooth" the discrete distribution into a distribution which is continuous 
with respect to the Lebesgue measure, and will convergence to the Gaussian distribution in total variation 
norm. 

The measurement "which block", corresponding to the decomposition (|3.11|) . provides us with a result 
j and a posterior state The function g n — g n (j) (with an additional randomization) is the classical 
part of the channel T n . The randomization consists of " smoothcning" with a Gaussian kernel of mean 
g n (j) and variance l/(2y / n), i.e. with r„.j :— (n 1 / 4 / \/tt) exp (— ^/n(x — g n (j)) 2 )- 

Note that this measurement is not disturbing the state p" in the sense that the average state after the 
measurement is the same as before. 

The quantum part of T n is the same as in |22j and consists of embedding each block state p"„ into the 
state space of the oscillator by means of an isometry Vj : Hj — > ^(C), 

Vj : |j,m) i-> |j - to), 

where {\j,m) : m = — j, . . . , j} is the eigenbasis of the total spin component L z := £^ ai l \ cf. equation 
(5.1) of |22| . Then the action of the channel T n is 

T n : /',,,;./ :/"„ ® — >-> Y. 1 '" "'■■> '■ Tn >i V jPlnVj ■ 



The inverse channel S n performs the inverse operation with respect to T n . First the oscillator state 
is "cut-off" to the dimension of an irreducible representation and then a block obtained in this way 
is placed into the decomposition (|3.10| (with an additional normalization from the remaining infinite 
dimensional block which is negligible for the states in which we are interested) . 

The rest of the proof is given in Appendix [A"l 

□ 



4 Time evolution of the interacting system 

In the previous section, we have investigated the asymptotic equivalence between the states p" and 
iV" u <S> 4> u by means of the channel T n . We now seek to implement this in a physical situation. The 
./V u -part will follow in section [531 the u -part will be treated in this section. 

We couple the n qubits to a Bosonic field; this is the physical implementation of LAN. Subsequently, 
we perform a measurement in the field which will provide the information about the state of the qubits; 
this is the utilization of LAN in order to solve the asymptotic state estimation problem. 
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In this section we will limit ourselves to analyzing the joint evolution of the qubits and field. The 
measurement on the field is described in section [5l 



4.1 Quantum stochastic differential equations 

In the weak coupling limit [18j the joint evolution of the qubits and field can be described mathematically 
by quantum stochastic differential equations (QSDE) 34J. The basic notions here are the Fock space, 
the creation and annihilation operators and the quantum stochastic differential equation of the unitary 
evolution. The Hilbert space of the field is the Fock space !F(L 2 (M.)) as defined in (|3.4[) . An important 
linearly complete set in JF(L 2 (IR)) is that of the exponential vectors 

oo OO 

e(f) := — /«» := — |/) n , / e L 2 (R), (4.1) 



with inner product (e(/), e(g)) = exp((/, g)). The normalized exponential states |/) := e~^'^ / 2 e(f) are 
called coherent states. The vacuum vector is |f2) := e(0) and we will denote the corresponding density 
matrix by $. The quantum noises are described by the creation and annihilation martingale 

operators A* t :— a*(x[o,t]) an d At := a(x[o,t]) respectively, where X[o,t] is the indicator function for [0,t] 
and 

a(f) ■ e{g) ^ {f,g)e(g)- 

The increments dAt := a>(x\o,t+dt]) — a (X[o.t]) and dA% play the role of non-commuting integrators in 
quantum stochastic differential equations, in the same way as the one can integrate against the Brownian 
motion in classical stochastic calculus. 

We now consider the joint unitary evolution for qubits and field defined by the quantum stochastic 
differential equation [311 IE] : 

dU n (t) = (a n dA* t - a* n dA t - \^a* n a n dt)U n {t), 

where U n (t) is a unitary operator on (C 2 )®" ® J-(L 2 (M.)), and 
1 " 

a n :=—==^°+\ 0+° := 1 ® ' ' ' ® + ia v)/ 2 ® ■ ■ ■ ® !) jn := (M - l/2)n. 

V 3 r ' 



k=l 

As we will see later, the "coupling factor" 1/v^n of the order -nT 1 / 2 , is necessary in order to obtain 
convergence to the unitary evolution of the quantum harmonic oscillator and the field. 

We remind the reader that the n-qubit space can be decomposed into irreducible representations as in 
(|3.10p , and the interaction between the qubits and field respects this decomposition 

n/2 

Un{t)= U j<n (t)®l, 
3=0,1/2 

where 1 is the identity operator on the multiplicity space Ti. J n , and 

Uj, n (t) : Hj ® T(L 2 (R)) -> Hj ® T(L 2 

is the restricted cocycle 

d.TL„(t) = (a, dAt - a*dA t - ^ 



dU j<n (t) = {a 3 dA* t - a*dA t - \-a*a.jdt)U^ n {t), (4.2) 
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with dj acting on the basis \j, m) of Jij as 



a o \j, m ) = Vj ~ m V (j + m+ l)/2j„ \j,m + l), 
a*j \j, m) = \/j - m + 1 Vj + m/ 2 Jn \j, m - 1). 

Remark. We point out that the lowering operator for L z acts as creator for our cut-off oscillator since 
the highest vector \j, j) corresponds by Vj to the vacuum of the oscillator. This choice does not have 
any physical meaning but is only related with our convention /i > 1/2. Had we chosen /i < 1/2, then 
the raising operator on the qubits would correspond to creation operator on the oscillator. 

By p. lip the initial state p® n decomposes in the same way as the unitary cocycle, and thus the whole 
evolution decouples into separate "blocks" for each value of j. We do not have explicit solutions to these 
equations but based on the conclusions drawn from LAN we expect that as n — > oo, the solutions will 
be well approximated by similar ones for a coupling between an oscillator and the field, at least for the 
states in which we are interested. As a warm up exercise we will start with this simpler limit case where 
the states can be calculated explicitly. 

4.2 Solving the QSDE for the oscillator 

Let a* and a be the creation and annihilation operators of a quantum oscillator acting on J-(C). We 
couple the oscillator with the Bosonic field and the joint unitary evolution is described by the family of 
unitary operators U(t) satisfying the quantum stochastic differential equation 

dU(t) = (adA* t - a*dA t - -a*adt)U{t). 

We choose the initial (un-normalized) state ip(Q) := e(z) <8 |f2), where z is any complex number, and we 
shall find the explicit form of the vector state of the system and field at time t: ip(t) := U(t)ip(0). 

We make the following ansatz: ip(t) — e(a t ) ® e(/t), where ft(s) ■= f(s)x[o,t](s) for some / E L 2 (R). 
For each (3 € C, g G L 2 (R), define I(t) := (e(/3) <g) e(g),ip(t)). We then have I(t) = exp(/3a(f) + {g,ft)), 
so that it satisfies 

dl(t) = (p&aQ) + g(t)f(t)) Kt)dt . (4.3) 

We now calculate 4il{t) with the help of the QSDE. Since A t e(f) = (X[o,i]> f) e (f): we have, for con- 
tinuous g, dA t e{g) = g{t)e(g)dt. However, since A s e(ft) is constant for s > t, we have dA t e(ft) = 0. 
Thus 

dl(t) = (e(/3) ® e(g), (adA* t - a*dA t - \a*adt)^{t)) = (g(t)a(t) - \(3a(t))I{t)dt . (4.4) 

Equating (|4.3|) with (|4.4|) for all t, j3 and continuous g, we find /(s) = a(s), 4^ct(t) = —^a(t). Thus 
a(t) = a(0)e-s*, f t (s) = a(0) X [o,t]( s ) e ~ hs with a (°) = z - 

In conclusion tp(t) = e(ze - ^*) ® e(ze~^ s X[o,t](s))- For later use we denote the normalized solution by 
</, z (t) : = U(t)\z) |0) = e-W 2 / 2 U(t)e(z) ® 

4.3 QSDE for large spin 

We consider now the unitary evolution for qubits and field: 

dU n (t) = (a n dA* t - a* n dA t - -a* n a n dt)U n (t). 
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It is no longer possible to obtain an explicit expression for the joint vector state ifj n (t) at time t. However 
we will show that for the states in which we are interested, a satisfactory explicit approximate solution 
exists. 

The trick works for an arbitrary family of unitary solutions of a quantum stochastic differential equation 
dU(t) = GdtU(t), and the general idea is the following: if ip(t) is the true state ip(t) = U(t)tp and £(t) is 
a vector describing an approximate evolution (tp(0) — £(0)) then with Uh dt := U(t + di)£/(i) _1 we get 

w + dt) - ^ + dt) = tp(t + dt) - ui +dt m + ui +dt m - m + ^) - + dt) 

= U l t+dt m) - f (t)] + [U(t + dt) - L/(t)]C/(t)- x e(t) 

By taking norms we get 

d\m)-m\\< \\G dt at) - dim. (4.5) 

The idea is now to devise a family £(t) such that the right side is as small as possible. 

We apply this technique block- wise, that is to each unitary Uj in (t) acting on Hj^>J 7 (L 2 (M.)) (see equation 
(|4.2[) ) for a "typical" j £ J n (see equation (|A.1[) ). By means of the isometry Vj we can embed the space 
Ttj into the first 2j + 1 levels of the oscillator and for simplicity we will keep the same notions as before 
for the operators acting on .F(C). As initial states for the qubits we choose the block states p" n . 

Theorem 4.1. Let pj n (t) = Uj >n (t) [pj n <g> $] U* n {t) be the j-th block of the state of qubits and field 
at time t. Let <^ u (£) := U(t) [</) u ® $] U(t)* be the joint state of the oscillator and field at time t. For 
any rj < 1/6, for any e > 0, 

sup sup sup ||p 3 u „(t) -</>"(£) lli =0(n- 1 / 4+ "+ e ,7i- 1 / 2+3 ' )+e ). (4.6) 

jeJn ||u||<n'J t 

Proof. From the proof of the local asymptotic normality Theorem 13.11 we know that the initial states of 
the two unitary evolutions are asymptotically close to each other 

sup sup \\pl n -r\\i = 0(n-^+% (4.7) 

j£j n ||u||<n.'' 

The proof consists of two estimation steps. In the first one, we will devise another initial state pj n which 
is an approximation of <p u and thus also of p"„: 

sup sup ||^„-0 u ||i=O(e-" 5 ). (4.8) 

j£j n ||u||<n» 

In the second estimate we show that the evolved states p™ n (t) and 4> u (t) are asymptotically close to each 
other 

sup sup sup||p" n (i)-0 u (t)||i =0(n- 1/A+r > +e ,n- 1 / 2+3r > +e ). (4.9) 
jej n H|<n" t 

This estimate is important because, the two trajectories are driven by different Hamiltonians, and in 
principle there is no reason why they should stay close to each other. 

^From (|4.7p , (|4.8p and (|4.9|) , and using triangle inequality we get 

sup sup sup||p"„(i)-0 u (i)||i =0(n- 1 / 4 +" +e ,n- 1 / 2 + 3 " +£ ). 

j£j n ||uj|<n'i t 

The following diagram illustrates the above estimates. The upper line concerns the time evolution of 
the block state p"„ and the field. The lower line describes the time evolution of the oscillator and the 
field. The estimates show that the diagram is "asymptotically commutative" for large n. 
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5(JT(C)) 5(JF(C)®JT) S(JF(C) ® JF) 

For the rest of the proof, we refer to Appendix [Bj 

□ 

We have shown how the mathematical statement of LAN (the joint state of qubits converges to a 
Gaussian state of a quantum oscillator plus a classical Gaussian random variable) can in fact be physically 
implemented by coupling the spins to the environment and letting them "leak" into the field. In the 
next section, we will use this for the specific purpose of estimating u by performing a measurement in 
the field. 



5 The second stage measurement 

We now describe the second stage of our measurement procedure. Recall that in the first stage a relatively 
small part n — 1 > k > 0, of the qubits is measured and a rough estimator p n is obtained. The 

purpose of this estimator is to localize the state within a small neighborhood such that the machinery 
of local asymptotic normality of Theorem 13. II can be applied. 

In Theorem 14. II the local asymptotic normality was extended to the level of time evolution of the qubits 
interacting with a bosonic field. We have proven that at time t the joint state of the qubits and field is 

3=0,1/2 JC 

| e (z e - t / 2 ) J )( e (ze- t / 2 ) J |®| e (ze-"/ 2 x [ o, t ](«)))<e(z e -"/ 2 x [ o, 4 ]M)| 
+0(n"- 1 / 4 + e ,7i 3 "- 1 / 2+e ), 

for ||u|| < n v . The index j serves to remind the reader that the first exponential states live in different 
copies ^(C)^ of the oscillator space, corresponding to Tij via the isometry Vj. We will continue to 
identify Tij with its image in jF(C)j. 

We can now approximate the above state by its limit for large t, since 

exp(-|z| 2 )( e (z e -*/ 2 ),|j,j)(e(z e - tl / 2 XM ( U ))| e (z e -"/ 2 ))=cxp(-|z| 2 e -*). (5.1) 

As we are always working with ||u|| < n v , the only relevant z are bounded by n v+s for small 5. (The 
remainder of the Gaussian integral has an exponentially decreasing norm, as discussed before). Thus, 
for large enough time (i.e. for t > ln(n)), we can write p"(t) = p"(oo) + 0(n ,1 ~ 1 / 4+€ , n 3 ' ,_1 / 2+c ) with 

n/2 

PnM == Pn,u(j)|i,j)0',j|® 
3=0.1/2 

(5.2) 



— ^ / dze-l z -v^T« u | 2 /2 S 2 | e ( ze -«/2)^ e(ze -«/2 ) | exp( _| z |2 ) 
2tts2 J c 



Thus, the field is approximately in the state <p u depending on (u x ,u y ), which is carried by the mode 
(u i— > e~"/ 2 X[o,oo)( u )) <= L 2 (R) denoted for simplicity by e~"/ 2 . The atoms end up in a mixture of \j, j) 
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states with coefficients p n ,u{j), which depend only on u z , and are well approximated by the Gaussian 
random variable iV u as shown in Theorem 13.11 Moreover since there is no correlation between atoms 
and field, the statistical problem decouples into one concerning the estimation of the displacement in a 
family of Gaussian states <fi u , and one for estimating the center of N u . 

For the former problem, the optimal estimation procedure is known to be the heterodyne measurement 
[32l[55]; for the latter, we perform a "which block" measurement. These measurements are described in 
the next two subsections. 

5.1 The heterodyne measurement 

A heterodyne measurement is a "joint measurement" of the quadratures Q := (a + a*)/y/2 and P := 
— i(a — a*)/y/2 of a quantum harmonic oscillator which in our case represents a mode of light. Since 
the two operators do not commute, the price to pay is the addition of some "noise" which will allow for 
an approximate measurement of both operators. The light beam passes through a beamsplitter having 
a vacuum mode as the second input, and then one performs a homodyne (quadrature) measurement 
on each of the two emerging beams. If Q„ and P„ are the vacuum quadratures then we measure the 
following output quadratures Qi := (Q + Q v )/y/2 and P2 := (P — P v )/V2, with [Qi, P 2 ] = 0. Since the 
two input beams are independent, the distribution of \/2Qi is the convolution between the distribution 
of Q and the distribution of Q„, and similarly for y/2P-2- 

In our case we are interested in the mode e~"/ 2 which is in the state </> u , up to a factor of order 
( n r,-i/4+e^ n 3r,-i/2+ey From Qgj) we btain that the distribution of Q is iV(^2(2^- 1)^,1/(2(2^- 

1))), that of P is N(y / 2(2fi — l)u y , l/(2(2/i — 1))), and the joint distribution of the rescaled output 

((Q + Q»)A/2(2//-l), (P - P„)A/2(2At - 1)) , 

is 

N(u x , m/(2(2/x - l) 2 )) x N(u yi /x/(2(2/x - l) 2 )). (5.3) 

We will denote by (u x ,u y ) the result of the heterodyne measurement rescaled by the factor y/2p, — 1 
such that with good approximation (u x ,u y ) has the above distribution and is an unbiased estimators of 
the parameters (u x ,u y ). 

Since we know in advance that the parameters (u x , u y ) must be within the radius of validity of LAN we 
modify the estimators (u x , u y ) to account for this information and obtain the final estimator (u x , u y ): 

„ _ J Ui if \ui\ < 3n n , 
Ut ~ \ if |uj| > 3n" [bA) 

Notice that if the true state p is in the radius of validity of LAN around p, then ||u|| < n v , so that 
\ui — Ui\ < |ttj — Uj|. We shall use this when proving optimality of the estimator. 

5.2 Energy measurement 

Having seen the <^> u -part, we now move to the iV u -part of the equivalence between and 7V U ® <fi u . 
This too is a coupling to a bosonic field, albeit a different coupling. We also describe the measurement 
in the field which will provide the information on the qubit states. 

The final state of the previous measurement, restricted to the atoms alone (without the field), is obtained 
by a partial trace of equation (|5.2|) (for large time) over the field 

n/2 

t%= £ Pn.uOOIi. i><i,il + o(» n_1/4+e .n 3,, " 1/3+e )- 
3=0,1/2 



16 



We will take this as the initial state of the second measurement, which will determine j. 

A direct coupling to the J 2 does not appear to be physically available, but a coupling to the energy J z is 
realizable. This suffices, because the above state satisfies j — m (up to order 0(ti ,,_1/ ' 4+£ , n 3,,_1 / 2+£ )). 
We couple the atoms to a new field (in the vacuum state |f2)) by means of the interaction 

dU t = {J z {dA* t - dA t ) - \j 2 dt}XJ t , 

with J z :— Yj*k=i a z- Since this QSDE is 'essentially commutative', i.e. driven by a single classical 
noise Bt = (A$ — At)/i, the solution is easily seen to be 

U t = exp(J z ® (A* t -A t )). 

Indeed, we have df(Bt) — f'(Bt)dBt + kf"(B t )dt by the classical ltd rule, so that 

dexp(i,J z ® B t ) = {i.J z dB t - \ J 2 dt} exp(iJ z ® B t ) . 

For an initial state \j, m) ® |f2), this evolution gives rise to the final state 

U t \j,m)<g>n = \j,m) ®exp((m/\/n)(A* t - A t ))Q 
= \j,m) \(m/y/n)x[o,t])> 
where |/) £ JF(i 2 (R)) denotes the normalized vector exp(— (/, /)/2)e(/). Applying this to the states 

in T n y ields 

n/2 

UtT^®^U* t = PnAjUj){j,j\®\j I^X[oa)^/^X[oa\ + Oin^'^^n^ 1 ' 2 ^) . 

3=0,1/2 

The final state of the field results from a partial trace over the atoms; it is given by 

n/2 

E Pn,u(i)|(i/VH)x [ o,t])((i/^)X[o,t]| + 0(n"- 1 / 4+£ ) n 3 ''- 1 / 2+£ ). (5.5) 

7=0,1/2 

We now perform a homodyne measurement on the field, which amounts to a direct measurement of 
(At + A* t )/2t. In the state \(j /y/nx[o,t]}, this yields the value of j with certainty for large time (i.e. 
t > y/n). Indeed, for this state, E((A t + A*)/2t) = j/s/ri, whereas Vax(A t + A* t )/2t) = l/(4t). Thus 
the probability distribution p IljU is reproduced up to order 0(n' ,_1 / 4+E , ?t. 3j?— 1 / 2+e ) in L 1 -distance. 



The following is a remider from the proof of Theorem 13.11 If we start with j distributed according to 
Pn(j) and we smoothen — \/n(fi — 1/2) with a Gaussian kernel, then we obtain a random variable 
g n which is continuously distributed on R and converges in distribution to N(u z ,fi(l — n)), the error 
term being of order 0(?/'~ 1/ ' 2 ) + 0(n £_1 / 2 ). For j distributed according to the actual distribution, as 
measured by the homodyne detection experiment, we can therefore state that g n is distributed according 
to 

N(u z , m(1 - /*)) + 0(n" _1/4+e , n^- 1 ' 2 ^) + O^- 1 ' 2 ) + 0(n £ - 1 / 2 ). (5.6) 

As in the case of (u x ,u y ), we take into account the range of validity of LAN by defining the final 
estimator 

9n if Iff™ | < 3n" , , 

if| fln |>3n". ( } 

Similarly, we note that if the true state p is in the radius of validity of LAN around p, then ||u|| < n v , 
so that \u z — u z \ < \u z — u z \. 
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6 Asymptotic optimality of the estimator 



In order to estimate the qubit state, we have proposed a strategy consisting of the following steps. First, 
we use h := n 1 ~ K copies of the state p to get a rough estimate p n . Then we couple the remaining qubits 
with a field, and perform a heterodyne measurement. Finally, we couple to a different field, followed by 
homodyne measurement. From the measurement outcomes, we construct an estimator p n := p^/^a- 

This strategy is asymptotically optimal in a global sense: for any true state p even if we knew beforehand 
that the true state p is in a small ball around a known state po, it would be impossible to devise an 
estimator that could do better asymptotically, than our estimator p n on a small ball around p. More 
precisely: 

Theorem 6.1. Let p n be the estimator defined above. For any qubit state po different from the totally 
mixed state, for any sequence of estimators g n , the following local asymptotic minimax result holds for 
any < e < 1/12: 

limsup sup nR{p, p n ) < lim sup sup nR(p,g n ). (6-1) 

n->oo ||p-p ||i<n-V 2 +« n-Hx> \\ p - po || 1 <n- 1 / 2 +« 

Let (po, 1 — po) be the eigenvalues of po with po > 1/2. Then the local asymptotic minimax risk is 

limSUp SUp ni?(p, p„) = i? m inimax(Aio) = 8/io - (6.2) 

ra->oo ||/9-p || 1 <n- 1 / 2 + e 

Proof. We write the risk as the sum of two terms corresponding to the events E and E c that p n is inside 
or outside the ball of radius n~ 1 ^ 2+€ around p. Recall that LAN is valid inside the ball. Thus 

R(P, Pn) = E(||/5 - Pn\\ 2 lXE-) + - p n \\j \e), 

where the expectation comes from p n being random. The distribution of the result rho n of our mea- 
surement procedure applied to the true unknown state p depends on p. We bound the first part by R\ 
and the second part by i?2 as shown below. 

R\ equals P(E C ) times the maximum error, which is 4 since for any pair of density matrices p and a, we 
have ||p - cr\\i < 4. Thus 

i?i=4P(||p- / 5„|| 1 >n- 1 /2+^). 

According to Lemma 12.11 this probability goes to zero exponentially fast, therefore the contribution 
brought by this term can be neglected. 

We can now assume that p n is in the range of validity of local asymptotic normality and we can write 
p0n _ pU u i oca i parameter around p n . We get the following inequalities for the second term 
in the risk. 

m\p-pn\\iXE) 



< E 
< 



< 



< 



\\Pn-p\\i WPn-pWl^n- 1 ^ 

l|Pn-p||i Pn = P0 



sup E 

p-po||<"~ 1/2+e 



sup E p u (oo) 
Hp— Po II <n- 1 / 2 + e 



\\Pn ~ P\\l 



Pn = PQ 



SUP WPntt) ~ Pn(°°)h SU P \\pn ~ P\\l 

||p-po||<n- 1 '' 2 + £ 



SUp E pS(oo) \\Pn-p\\i 

p-p ||<n-V 2 +* 

l+2n 



cn 



sup 



Pn = PO 

p»(t)-p»(oo)\\i = R 2 . 



(6.3) 
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The first two inequalities are trivial. In the third inequality we change the expectation from the one with 
respect to the probability distribution of our data to the probability distribution Ppu^). In doing 

so, an additional term ||Pp«(t) — P p ^(oo)||i appears which is bounded from above by ||p"(£) — /o"(oo)||i. 
In the last inequality we can bound \\p n — p\\\ by cn~ 1+2ri for some constant c. Indeed from definitions 
(|5.4[) and (|5.7[) we know that \\p n — po\\i < c'nT 1 / 2 ^ and additionally we are under the assumption 
\\p- Polli < n~ 1/2+e with e < r\. 

For the following, recall that all our LAN estimates are valid uniformly around any state p° = p as long as 
/i — 1/2 > £2 > 0. As we are working with p different from the totally mixed state and \\p — p\\ < n~ 1 / 2+e , 
we know that for big enough n. p — 1/2 > e-i for any possible p. We can then apply the uniform results 
of the previous sections. 

The second term in i?2 is 0(n~ 5 / 4+3v+s ,n~ 3 / 2+5r,+s ) where S > can be chosen arbitrarily small. Indeed 
in the end of section [4] we have proven that after time t > Inn, the following holds: — p"(oo)||i = 

0(n _1 / 4+,,+ ' 5 , n - 1 / 2 + 3 '?+ l5 ). The contribution to nR(p,p n ) brought by this term will not count in the 
limit, as long as r/ and e are chose such that 1/12 > rj > e. 

We now deal with the first term in i?2- We write p in local parametrization around po — p as p Un /^- 
We have 

II* |i2 I, ||2 , K - + ( 2 M ~ 1) 2 (K ~ u x ) 2 + (u y - u y ) 2 ) 

\\Pn ~ Pill = llPu/Vn _ Pun/^lll - 4 ~ 

+ 0(||u-u„|| 3 n- 3 / 2 ). (6.4) 

The remainder term 0(||u — u„|| 3 n -3 / 2 ) is negligible. It is 0(n 3v ~ 3 / 2 ) which does not contribute to 
nR(p,p n ) for r] < 1/6. This is because on the one hand we have asked for \\p n — p\\ < n _1 / 2+c , and on 
the other hand, we have bounded our estimator u n by using (|5.4|) and ()5.7|) . 



We now evaluate E^u^) [d(u, u„) 2 ] with the notation 

d(u, v) 2 := 4 [{u z - v z ) 2 + (2p - \) 2 {{u x - v x ) 2 + (u y - v y ) 2 )} . (6.5) 

Note that the risk of u„ is smaller than that of u n (see discussion below (|5.4p and ). Under the 
law Ppu(oo) the estimator u„ has a Gaussian distribution as shown in (|5.3|) and (|5.6p with fixed and 
known variance and unknown expectation. In statistics this type of model is known as a Gaussian shift 
experiment [49]. Using (|5 .3[) and (|5.6p . we get Epu(oo) [(u z — ii z ) 2 ] < p(l~ p) and Bp,,^) [(u^ — Mi) 2 ] < 
p/(2(2p — l) 2 ) for i = x,y. Substituting these bounds in (16. 4|) . we obtain (|6.2[) . 



We will now show that the sequence p n is optimal in the local minimax sense: for any po and any other 
sequence of estimators g n we have 

i?o = limsup sup nR{p, g n ) > 8/io — 4/1q. 

n^oo ||p-p ||i<n- 1 /2 + £ 

We will first prove that the right hand side is the minimax risk i? m inimax(/^o) for the family of states 
N u (g) <fi u which is the limit of the local families of qubit states centered around po. We then extend 
the result to our sequence of quantum statistical models p". 

The minimax optimality for 7V U <g) U can be checked separately for the classical and the quantum part 
of the experiment. For the quantum part U , the optimal measurement is known to be the heterodyne 
measurement. A proof of this fact can be found in Lemma 7.4 of [22]. For the classical part, which 
corresponds to the measurement of L z , the optimal estimator is simply the random variable X ~ A^ u 
itself [49]. 

We now end the proof by using the other direction of LAN. Suppose that there exists a better sequence 
of estimators g n such that 

(p ) = 8p - 4p . 
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We will show that this leads to an estimator u of u for the family iV u ® 4> u whose maximum risk is 
smaller than the minimax risk -Rminimax(Mo), which is impossible. 

By means of a beamsplitter one can divide the state </> u into two independent Gaussian modes, using a 
thermal state <j> :— <fi° as the second input. If r and t are the reflectivity and respective transmitivity 
of the beamsplitter (r 2 + 1 2 = 1), then the transmitted beam has state = <p tu and the reflected one 
4>™ e f = ru - By performing a heterodyne measurement on the latter, and observing the classical part 
iV u , we can localize u within a big ball around the result u with high probability, in the spirit of Lemma 
12.11 More precisely, for any small e > we can find a > big enough such that the risk contribution 
from unlikely u's is small 

E(||u-u|| 2 X|| u _ fi || >a ) < e. 

Summarizing the localization step, we may assume that the parameter u satisfies ||u|| < a with an e loss 
of risk, where a = a(r,e). 

Now let n be large enough such that n e > a, then the parameter u falls within the domain of convergence 
of the inverse map S n of Theorem 13.11 and by (|3.9|) (with e replacing r\ and S replacing e) we have 

\\pf - S(N tu <8> (£ tu )||i < Cn- 1 / 4 ^ 5 , 

for some constant C. 

Next we perform the measurement leading to the estimator g n and equivalently to an estimator u n of 
u. Without loss of risk we can implement the condition ||u|| < a into the estimator u„ in a similar 
fashion as in (15. 4|) and (|5.7p . The risk of this estimation procedure for </> u is then bounded from above 
by the sum of three terms: the risk nR p (g n )/t 2 coming from the qubit estimation, the error contribution 
from the map S n which is a 2 n~ 1 / 4+c+<5 , and the localization risk contribution e. This risk bound uses 
the same technique as the third inequality of (|6.3[) . The second contribution can be made arbitrarily 
small by choosing n large enough, for e < 1/4. From our assumption we have i?o < RminimaxilMy} 
and we can choose t close to one such that Ro/t 2 < R m inimax(^o) an d further choose e such that 

-^o/^ ^ 6 <C Rjninimax 

In conclusion, we get that the risk for estimating u is asymptotically smaller that the risk of the 
heterodyne measurement combined with observing the classical part which is known to be minimax [22 . 
Hence no such sequence g n exists, and p n is optimal. 

□ 



Remark. In Theorem l6.il we have used the risk function R(p, p] — E(d 2 (p, p)), with d the Li-distance 
d(p, p) = ||p — p\\i. However, the obtained results can easily be adapted to any distance measure 
d 2 (pu,Pu) which is locally quadratic in u — u, i.e. 

d 2 (pu,Pu)= ^ 7a0(u a - U a )(up - Up) + 0(\\u ~ il\\ 3 ) . 
a,f3—x,y^z 



For instance, one may choose d 2 (p,p) = 1 — F 2 (p,p) with the fidelity F(p,p) :— Tr(\/ \fpp\fp). For 
non-pure states, this is easily seen to be locally quadratic with 

(2p ~ I) 2 

(2^ - I) 2 

l-(2,!o-l)* 

For the corresponding risk function Rp(p,p n ) := E(l — F 2 (p,p n )), this yields 

limsup sup tiRf(p, p n ) = Mo + 1/4 , (6-6) 

™~^°° l|p-po||i<ii- 1/2+e 
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with the same asymptotically optimal p. The asymptotic rate Rf ~ 4AI 4 ° n t " 1 was found earlier in [3], using 
different methods. 



7 Conclusions 

In this article, we have shown two properties of quantum local asymptotic normality (LAN) for qubits. 
First of all, we have seen that its radius of validity is arbitrarily close to n -1 / 4 rather than rT x l 2 . And 
secondly, we have seen how LAN can be implemented physically, in a quantum optical setup. 

We use these properties to construct an asymptotically optimal estimator p n of the qubit state p, provided 
that we are given n identical copies of p. Compared with other optimal estimation methods 4, 29J, our 
measurement technique makes a significant step in the direction of an experimental implementation. 

The construction and optimality of p„ are shown in three steps. 

I In the preliminary stage, we perform measurements of a x , a y and a z on a fraction h — n 1 ~ K of 
the n atoms. As shown in section [3J this yields a rough estimate p n which lies within a distance 
rt~ 1,/2+£ of the true state p with high probability. 

II In section [31 it is shown that local asymptotic normality holds within a ball of radius n~ 1 / 2 + T ' 
around p (n > e). This means that locally, for n — > oo, all statistical problems concerning the n 
identically prepared qubits are equivalent to statistical problems concerning a Gaussian distribution 
iV u and its quantum analogue, a displaced thermal state cj) u of the harmonic oscillator. 

Together, I and II imply that the principle of LAN has been extended to a global setting. It can now be 
used for a wide range of asymptotic statistical problems, including the global problem of state estimation. 
Note that this hinges on the rather subtle extension of the range of validity of LAN to neighborhoods 
of radius larger than nT 1 / 2 . 

Ill LAN provides an abstract equivalence between the n-qubit states P®"^ on the one hand, and on 
the other hand the Gaussian states iV u ® <j> u . In sections [4] and [5] it is shown that this abstract 
equivalence can be implemented physically by two consecutive couplings to the electromagnetic 
field. For the particular problem of state estimation, homodyne and heterodyne detection on the 
electromagnetic field then yield the data from which the optimal estimator p n is computed. 

Finally, in section [51 it is shown that the estimator p n , constructed above, is optimal in a local minimax 
sense. Local here means that optimality holds in a ball of radius slightly bigger than n -1 / 2 around any 
state po except the tracial state. That is, even if we had known beforehand that the true state lies within 
this ball around po, we would not have been able to construct a better estimator than p„, which is of 
course independent of po. 

For this asymptotically optimal estimator, we have shown that the risk R converges to zero at rate 
R(p, p n ) ~ 8m °~ 4 ^o ; with po > 1/2 an eigenvalue of p. More precisely, we have 

limsup sup nR(p, p n ) — 8po — 4/Jq- 

™^°° l|p-po||l<n~ 1/2+e 

The risk is defined as R(p, p) = E(d 2 (p, p)), where we have chosen d(p, p) to be the Li-distance \\p— p\\\ := 
Tr(|p — p|). This seems to be a rather natural choice because of its direct physical significance as the 
worst case difference between the probabilities induced by p and p on a single event. 

Even still, we emphasize that the same procedure can be applied to a wide range of other risk functions. 
Due to the local nature of the estimator p n for large n, its rate of convergence in a risk R is only sensitive 
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to the lowest order Taylor expansion of R in local parameters u — u. The procedure can therefore easily 
be adapted to other risk functions, provided that the distance measure d 2 (pu,p u ) is locally quadratic in 
u — u. 

Remark. The totally mixed state (p = 1/2) is a singular point in the parameter space, and Theorem 
13.11 does not apply in this case. The effect of the singularity is that the family of states p.6j) collapses 
to a single degenerate state of infinite temperature. However this phenomenon is only due to our 
particular parametrisation, which was chosen for its convenience in describing the local neighborhoods 
around arbitrary states, with the exception of the totally mixed state. Had we chosen a different 
parametrisation, e.g. in terms of the Bloch vector, we would have found that local asymptotic normality 
holds for the totally mixed state as well, but the limit experiment is different: it consists of a three 
dimensional classical Gaussian shift, each independent component corresponding to the local change in 
the Bloch vector along the three possible directions. Mathematically, the optimal measurement strategy 
in this case is just to observe the classical variables. However this strategy cannot be implemented by 
coupling with the field since this coupling becomes singular (see equation (|4.2[1 ). 

These issues become more important for higher dimensional systems where the eigenvalues may exhibit 
more complicated multiplicities, and will be dealt with in that context. 
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A Appendix: Proof of Theorem 13.11 

Here we give the technical details of the proof of local asymptotic normality with "slowly growing" local 
neighborhoods ||u|| < n n , with r] < 1/4. We start with the map T n . 

A.l Proof of Theorem I3.lt the map T n 

Let us define, for 0<e<(l/4 — rf) the interval 

Jn = [j ■ (M - l/2)n - n 1/2+e < i < (M - l/2)n + n 1/2+e } . (A.l) 

Notice that j S J n satisfies 2j > for all p — 1/2 > e 2 and n big enough, independently of p. 
Then J n contains the relevant values of j, uniformly for p — 1/2 > e 2 ; 

Mm PnAJn) = 1 - 0(n- 1/2+e ). (A.2) 

n — >oo 

This is a consequence of Hoeffding's inequality applied to the binomial distribution, and recalling that 
PnAl) = B(n/2 + j)(l + 0{7i- 1 ' 2 +')) for j e J n . 

We upper-bound \\T n (p™) — N u <K> </> u | by the sum 



3 E Pn,j 



N u -J2 Pn,u(j)r„ 

3&Jn 



+ sup \\V jP l n V* (A.3) 

1 
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The first two terms are "classical" and converge to zero uniformly over ||u|| < nT 1 : for the first term, this 
is (|A.2[) , while the second term converges uniformly on fi — 1/2 > t2 at rate n 1 * -1 / 2 [37]. The third term 
can be analyzed as in Proposition 5.1 of 22 : 



< 



^ u vS-|| 1 + II^ u --p^ u ^IIi. 



(A.4) 



where Pj :— VjV* is the projection onto the image of Vj. We will show that both terms on the right side 
go to zero uniformly at rate n _1 / 4+ ' ,+c over j £ J n and ||u|| < n v . The trick is to note that displaced 
thermal equilibrium states are Gaussian mixtures of coherent states 



1 



V2^ 



-|z-V2/i-la u r/2s 2 



\z)(z\)d\ 



(A.5) 



where s 2 := (1 - /i)/(4/x- 2). 

The second term on the left side of (|A.4[) is bounded from above by 

1 



-|z-V2/J-la u r/2s 2 



z)(z\-P j \z}(z\Pj\\ 1 d\ 



V^tts 2 

which after some simple computations can be reduced (up to a constant) to 



J e-l z l 2 / 2s2 ||^|z + v ^7^Ta u )||d 2 z. 



(A.6) 



We now split the integral, the first part is integrating over |z| > n v+s with < 5 < 1/4 — ry/2. The 
integral is dominated by the Gaussian and its value is 0(e~" *' + /( 2s )). The other part is bounded by 
the supremum over |z| < 2ri n+s (as ||u|| < n*) of ||P/|z)||. Now ||P/|z)|| < |zp'/VjT = 0( e -"( 1 /2-r,-25)^ 
uniformly on j G J n -, for any fi — 1/2 > €2 since then 2j > €2^. 

The same type of estimates apply to the first term 





Ad 




















+ 


Ad 













(v*^) - v*n 



< 



(A.7) 



The first term on the right side does not depend on u. From the proof of Lemma 5.4 of [22] . we know 
that 



\o° ■ - V*6°V-\\ < ( p2j+1 I- n 2j+1 \ 

\Pn,j v j <P v l\\ 1 b I 1 _ p2 j+l + P j 



with p = (1 — y«)//i. Now the left side is of the order p 2j+1 which converges exponentially fast to zero 
uniformly on /1 — 1/2 > £2 and j G J7„. 

The second term of (IA.7I) can be bounded again by a Gaussian integral 



V27TS 

where the operator A(u, z, j) is given by 



1 l e -M 2 /2, 2 || A(lw) || ld 2 Z; 



(A. 



A(u,z,j) := Ad \Uj (u/^)] (y/|z)(z|^) - K/ |z + v / 27^Ia u )<z + v / 27^1a u | Vj. 
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Again, we split the integral along ||z|| > n v+s . The outer part converges to zero faster than any power 
of n, as we have already seen. The inner integral, on the other hand, can be bounded uniformly over 
||u|| < n v , \x — 1/2 > £2 and j S J n by the supremum of || A(u, z, j)||i over |z| < 2n n+s , fx — 1/2 > 62, 
j S J„ and ||u|| < tP. 

Let z € K 2 be such that = z/y/2/u, — 1, and denote ip(n,j,v) — VjUj(v/^/n)\j,j). Then, up to a \[2 
factor, ||A(u, z, j)|ji is bounded from above by the 

\\ip(n,j,i) - |z)|| + 



t/j(n, j, u + z) - |z + \/2fj, - la u ) 



u 



u + z 



lii) 



(A.9) 



This is obtained by adding and subtracting \vp(n, j, z)) (ip(n, j, z)| and | ^(h, J, u + z)) (ip(n, j, u + z) | and 
using the fact that || (V 1 ! — = V^IIV' — 011 f° r normalized vectors ip,<j). 

The two first terms are similar, we want to dominate them uniformly: we replace u + z by z with 
Izl < 2n ri+s . We then write: 



(n,j, z)-|z}|| 2 = £|(£#(n,j,z))-(fc|z) 



fc=0 



r-l 



< £| (^KP)l - <fc|z>| 2 + 2^ (|(fc|^(n,j, z))| 2 + |(fc|z) 

fc— /c— r 



(A.IO) 



If z = \z\e ie then we have [29] 

(k\ip(n,j,z)) 



5in(|z|/^)e ie ) fc (cos(|z|^)) 



2j-k 



exp 



(2 M -l)|z| 2 \ (e sfl |z|V27^T) 



In (jA~T0| we choose r = n 2n+e3 with £3 satisfying the conditions 26 + 2r\ + e < 277 + 63 + 6 < 1/2 and 
77 + 63 < 1/4. Then the tail sums are of the order 



|z| 2 '' < [2n^) 2n2v+e3 



( n 2r,+ e3 )! 



k—r 



o(exp(-7i 2,?+e3 )) , 



J2\{k\4>(n,j,Z))\ 2 <J2( -^- 



< n-L-L— =o(exp(-7i 2 " +e3 )) . 



x n I (2j - k)\k\ ~ r\ 

k=r k=r v ; 

For the finite sums we use the following estimates which are uniform over all |z| < 2n r i+ s , k<r,je J n : 



-l/2+£+2r;+e3 



(sin(|z|/VH)) fe - (|z|/V^) fe (l + 0(n 4 "+ e3 + 25 - 1 )), 

(2M-1)N 2 



))• 



(cos(|z|/V^)) 



2j-fc 



exp 



(l + 0(n 2 ^ 2+e+2S )), 



where we have used on the last line that (1 + x/n) n — exp(x)(l + 0(n l ' 2 x)) for x < n 1 / 2 64 (cf. [37 ). 
This is enough to show that the finite sum converges uniformly to zero at rate 0(n 2r, ~ 1 / 2+€+e3 ) (the 
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worst if £3 is small enough) and thus the first second terms in (|A.9[) as the square root of this, that is 

Qf n ri-l/±+e/2+e 3 /2\ 

Notice that the errors terms depend on fi only through j, and that 2j > en for \i — 1/2 > 62. Hence 
they are uniform in /x. 

We pass now to the third term of (|A.9|) . By direct computation it can be shown that if we consider two 
general elements exp(iXi) and expiiX-i) of SU(2) with selfadjoint elements of M(C 2 ) then 

exp(-i(X 1 +X 2 ))exp{iX 1 )exp{iX 2 )exp{[X 1 ,X 2 }/2) = 1 + 0(X n X l2 X l3 ), (A.ll) 

where the O(-) contains only third order terms in X±, X 2 . If X\, X 2 are in the linear span of a x and a y 
then all third order monomials are such linear combinations as well. 

In particular we get that for z, u < n 17 " 1 " 63 : 

U{(3) := U ( - " ~L V ) U (-^=)U (-^=) exp(i(u x v y - u y v x )a z /n) 



n J \ y/n J \ y'n / 

1 + 0(n- 2+4l '+ 4e3 ) 0( n -3/2+3^+3e 3 ) 
( n -3/2+37;+3e3 \ l + ( n -2+4»7+4<:3 ' 



(A.12) 



Finally, using the fact that \j,j) is an eigenvector of L z , the third term in (|A.9|) can be written as 

llli,i)(i,i|-^(^)li,i)(i,il^(^ril 

and both states are pure, so it suffices to show that the scalar product converges to to one uniformly. 
Using (|A.12p and the expression of {j\Uj(j3)\j) [29] we get, as j < n, 

= [U(/3) hl Y = 1 + 0(n- 1+4 ^), 

which implies that the third term in (|A.9j) is of order 0(n~ 1+4,,+4c3 ). By choosing e$ and e small enough, 
we obtain that all terms used in bounding (|A.8|) are uniformly 0(n _1 / 4+I,+e ) for any e > 0. 

This ends the proof of convergence (|3.8p from the n qubit state to the oscillator. 
A. 2 Proof of Theorem HQ the map S n 

The opposite direction (|3.9p does not require much additional estimation, so will only give an outline of 
the argument. 

Given the state N u (g> (ft 11 , we would like to map it into p" or close to this state, by means of a completely 
positive map S n - 

Let X be the classical random variable with probability distribution 7V U . With X we generate a random 
j 6 Z as follows 

j(X) = [Vn~X + n( t i- 1/2)]. 

This choice is evident from the scaling properties of the probability distribution which we want to 
reconstruct. Let be the probability distribution of j(X). By classical local asymptotic normality 
results we have the convergence 

sup \\ql-pl\\x = 0{vP- 1 ' 2 ). (A.13) 

||u||<ni 

Now, if the integer j is in the interval J n then we prepare the n qubits in block diagonal state with the 
only non-zero block corresponding to the j'th irreducible representation of SU(2): 

'h 
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The transformation (f> u i— > r " • is trace preserving and completely positive [22) . 

If j ^ then we may prepare the qubits in an arbitrary state which we also denote by r" ■. The total 
channel S n then acts as follows 

n/2 

5„ : iV u ® 0" ^ r,V := </",-",- 

i=o,i/2 

We estimate the error — r"||i as 

Hp" - Clli < IK-pJJIIi + 2P p u(j £ + sup IK, - 

The first term on the r.h.s. is 0(n v ~ 1/2 ) (see (|A.13|) ). the second term is 0{n e ~ 1/2 ) (see (IA.2I0 . As for 
the third term, we use the triangle inequality to write, for j € J n , 

\Kj - Pijh < IIC - 17071I1 + 11*7^/ - a^-IIi • 

The first term is 0(e- n ^/ 2 -^- 2 ^), according to the discussion following equation (|A.6[) . The second 
term on the right is 0(n~ 1 ^ 4,+r,+e ) according to equations (|A.7[) through (|A.12[) . 

Summarizing, we have ||S'„(iV u ® U ) — i = 0(n _1 / 4+,,+e ), which establishes the proof in the inverse 
direction. 

□ 



B Appendix: Proof of Theorem 14.11 

First estimate. We build up the state pj n by taking linear combinations of number states \m) to obtain 
an approximate coherent state |z), and finally mixing such states with a Gaussian distribution to get an 
approximate displaced thermal state. Consider the approximate coherent vector Pm|z), for some fixed 
z G C and m = n 7 , with 7 to be fixed later. Define the normalized vector 



„Pm|z)|| ^ Vrn! 

1/11 rn— 

We mix the above states to obtain 



\/27rs 2 

Recall that s 2 = (1 - /i)(4^ - 2), and 



— f er^-^ 1 ^ 2 ^ 2 (\z)(z\) d 2 z. 



V2vrs 2 

From the definition of |?/»™ ■) we have 

z 



|||C J )-|z)||<^^L=A2, (B.2) 



which implies 



V7TS 2 7 \ \flfl\ J 
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for any e > 0, for any 7 > 2(ry + e). Indeed we can split the integral into two parts. The integral 



over the domain |z| > n ri+<i is dominated by the Gaussian factor and is 0(e 



). The integral 



over the disk |z| < n n+e is bounded by supremum of (|B.2[) since the Gaussian integrates to one, and is 



0(e ^l 2 n e ) n ' T ). In the last step we use Stirling's formula to obtain log (n 71+e ) n ~' /\fnF\ 



(77 + e 



7/2)n 7 logn. Note that the estimate is uniform with respect to [i — 1/2 > £2 for any fixed £2 > 0. 

Second estimate. We now compare the evolved qubits state pf n {t) and the evolved oscillator state 4> u (t). 
Let \ipm,j(t)) = Uj. n (t) \m) ® \ be the joint state at time t when the initial state of the system is |m) 
corresponding to | j, j — m) in the L z basis notation. We choose the following approximation of {ip^ j (t)) 



:=^c n (m,i)a i {t)\m-i)®\e- 1 / 2u X[o,t](u))i, 



(B.3) 



where a<(t) = exp((-m + i)t/2), c n (m,i) := c„(m,i - l)^ 23 2 " + ~ y / m with c„(m,0) := 1, and 
:= /®" as defined in (|4.1[) . In particular for /i — 1/2 > £2 and j E J n we have c n (m,i) < 



^(l + in-Va+Oi. 
We apply now the estimate (|4.5p . By direct computations we get 



^ m 

^ICjW) = -~^c n (m,i)a i (t)(m-i)\m-i)®\e- 1 / 2u X [o,t](u))idt 



i=0 



^^(m.ijaj-x^lm - i) <g> \e 1/2u X[o,t](«))i-i ® s \X[t,t+dt]) , 



(B.4) 



where 



»+i 



fc=i 



From the quantum stochastic differential equation we get 



1 



-^]c„(m,i)a,(t)(m- i) 



i=0 



2j — m + i + 1 
2X 



m-t)® |e 1/2 "x [ o,t](u)) l * 



(m — i)(2j — m + i + 1) 



(B.5) 



2jn(* + 1) 



m — i — 1 ) 



|e 1/2u X[o,t](M)) l ® s \X[t,t+dt])- 



In the second term of the right side of (|B.5|) we can replace c„(m, i)y ( m ' 2 j 2 ~(- H ^ t+1 ^ by c„(m, i + 1) 
and thus we obtain the same sum as in the second term of the left side of (IB. 41) . Thus 



G dt |C J W)-rf|C J W> = 

1 m_1 9( ' — ') 4- — ' — 1 

- c n (m, ? H(t)(m - i) U " Jj + m |m - i> ® le" 1 / 2 ^,^))* 



Then using c„(m,i) < + (2/£ 2 )n- 1 /2+e)« we ge t that ||<?dt£™ ~ ^CiWII is bounded from 

above by 



m— 1 



i=0 



^ 7 ((l+n-V»^(l-e-*))'e 



t ^ ip _ (ro _ i)t f (2(j„-j)+rn-z-l)(m-z) 

2j„ 



1/2 



dt. 
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We have 

(2(7„ -3) + m-i- l)(m - i) = 0(m(n - 1/2+ e + n - lfn)) 



2j„ 

Inside the sum we recognize the binomial terms with the m'th term missing. Thus the sum is 

(l + n-V 2+e - e-'n" 1 / 2 ^)" 1 - ((1 - e - { )(l + n^+^Y 
< (1 + rr 1/2+e ) ro (l - (1 - e~*) m ) < (1 + yi- 1 / 2 ^)™ me"'. 
Then there exists a constant C (independent of fi if // — 1/2 > e%) such that 

i/2 



By integrating over £ we finally obtain 

IICmO) " Cj(*)ll < Cm 3 / 2 ^- 1 ^ + rrm" 1 ) (l + |«" 1/2+£ ) 



m/2 



(B.6) 



Note that under the assumption 7 < 1/3 — 2e/3, the right side converges to zero at rate n 37 / 2 1 / 2 + e for 
all m < to = n 7 . Summarizing, the assumptions which we have made so far over 7 are 

2r/ + 2e < 7 < l/3-2e/3. 

Now consider the vector (i/'zj) as defined in (|B.1[) and let us denote \ip™ t j(t)) = Uj tn (t)\ip™ ) ® Then 
based on (|B.3j) we choose the approximate solution 

lffj(*)> = e- |z| " /2 E^E C «( TO > «*)l m - f > ® l e ~ 1/2u X[o,*] («)>i- 
m=0 Vm! i=0 

Note that the vectors ■(<)) and .(t)) live in the "fc-particle" subspace of Tij £g>.F(L 2 (R)) and thus 
are orthogonal to all vectors \ippj(t)) and |£™ -(t)) with p ^ k. By (|B.6|) , the error is 



(" l |_|2m / 9 \ ' 

m=0 TO ' \ e 2 / 



1/2 



77) ! 



m / 2 Ul2m 



l + ± n -V2+e) . (B.7) 

e 2 y m! 



We now compare the approximate solution ^j(t) with the "limit" solution ip z (t) for the oscillator 
coupled with the field as described in section [4~2l We can write 



Im 



Mt) = z~ £ 7= E \/ ( • >- (ro - i)i/2 |m - *) le^/^Xlo.tjM) 

m— i—0 V ^ ' 

Then 

llffj(*)-^(*)ir = 

c^O-t/f?) (l-e-«)' + e-N" £ 



e -M* V" IE! y^ e -(m-i)t 

^ to! ^ 

m— i—0 



TO! 
m— m 
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Now 



c n (m,i) 



< 



< 



c n {m,if 



i-n(i+ 

P =i 



2 0' -jn)-m+p 

2jn 



< C2| . I'/'" 



-1/2+e 



where C2 does not depend on /i as long as — 1/2 > e 2 (recall that the dependence in /i is hidden in 
j n = (2/i - l)n). Thus 



"<N Jl " l z l 2 "< C72n - 1 / 2+1z|2 l« 



2m 



rn I 



7,1 



(B. 



m=0 



From (|B~7)) and (|B~8|) we get 

ll^2j(*)-^(*)ll < 
2 A 



£2 



/ •> \m/2 |„|2m 

(7m 3 / 2 (n- 1 / a+e + mn" 1 ) ( 1 + -n" 1 / 2 ^ ) + ^_ + 
E(fh, n, z) 



//). ! 



m' 



1/2- 



We now integrate the coherent states over the displacements z as we did in the case of local asymptotic 
normality in order to obtain the thermal states in which we are interested 



p7 



1 



We define the evolved states 



e -| Z -« u |-/2^| Cj)(C J d 2 



Then 



PlnV) ■= and u (t) := U {t)<f> n U {tf , 

sup sup ||p" n (i)-0 u (t)||i< sup [ e-\ z -^^ Ia ^ 2 / 2s2 E(rh : n,z)d 2 z. 

jeJn ||u||<ni ' ||u||<ni V7TS 2 7 



Here again we cut the integral in two parts. On |z| > n' ,+c , the Gaussian dominates, and this outer part 
is less than e~ n . Now the inner part is dominated by sup| z |<„,,+ e E{rh, n, z). Now we want rh to be 
not too big for (|B.7|) to be small, on the other hand, we want z 2m /m! to go to zero. A choice which 
satisfies the condition is 7 = 2ij + 3e. By renaming e we then get 

E(m,n,z) = 0(n"- 1 / 4+e ,7i 3 "- 1 / 2 + e ), 

for any small enough e > 0. Hence we obtain (|4.6|) . 

□ 
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